RAW SEQUENCE LISTTNC 
ERROR REPORT 



BIOTECHNOLOGY JSlfl'_ „ 
SYSTEMS ' ' "' ' '"' ^ 

BRANCH 




I^!° teCbll0l0gy SyStemS Branch of the Scientific Technical Information Center 
(J>TIC) detected errors when processing the following CRF diskette: 



Application Serial Number: 
Art Unit / Team No. . /(>% 
Date Processed by STIC: 



3 Mn 



THE ATTACHED PRINTOUT EXPLAINS THE ERRORS DETECTED. 
PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT 
COMPLY NICATI ° N T ° APPLICANTS AL °NG WITH A NOTICE TO 

2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY ° U 1 

^^LL^E^ERROR^F^EE^^ NEXT SUBMISSION RECEIVED FROM THEM 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 
ARTI SHAH 703-308-4212 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 



Wrapped Nucleics 



The number/text at the end of each line ■wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Wrapped Aminos 



Incorrect Line Length 



Misaligned Amino Acid 
Numbering 



The amino acid number/text at the end of each line "wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The rules require that a line not exceed 72 characters in length. This includes spaces. 
All text must be visible on page. 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and uses spacing between the numbers. 



Non-ASCII 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 



Variable Length 



Sequence(s) . 



. contain n*s or Xaa's which represented more than one residue. 



As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) features section that some may be missing. 



Wrong Designation 



Sequence(s) contain amino acid or nucleic acid designators which are not standard 

representations as per the Sequence Rules (Please refer to paragraph 1 .822) 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence Is intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



10 Use of n's or Xaa's 

(NEW RULES) 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



1 1 Use of <213>Organism 

(NEW RULES) 



Sequence(s) . 



. are missing this mandatory field or its response. 



12 Use of <220> Feature 

(NEW RULES) 



Sequence's). 



. are missing the <220>Feature and associated headings. 



Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register/ 6/01/98, Vol. 63 t No. 104, pp. 29631-32) 
(Sec. 1 .823 of new Sequence Rules) 



13 



Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager" or any other means to copy file to floppy disk. 



1646 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383, 676 



DATE: 
TIME: 



03/22/1999 
13:50:51 



INPUT SET: S31123.raw 



— > 



— > 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 



SEQUENCE LISTING 



(1) 



General Information: 



7 



(i) APPLICANT : Hauptman, 

(ii) TITLE OF INVENTION: 



(iii) NUMBER OF SEQUENCES 



(iv) CORRESPONDENCE ADDRESS 

( A ) ADDRESSEE : John J. McDonnell 

( B ) STREET : 300 S. Wacker Drive 

( C ) CITY : Chicago 

( D ) STATE : IL 

( E ) COUNTRY : USA 

(F) ZIP:60606 




7 



PROTEINS, AND DNAs 

jut 




(v) 
(A) 
(B) 
(C) 
(D) 



(vi) CURRENT APPLICATION DATA: 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC- DOS/MS- DOS 
SOFTWARE: ASCI I 

(A) APPLICATION NUMBER: 08/£$£7312^ 

(B) FILING DATE : June 7, 1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: John J. McDonnell 

(B) REGISTRATION NUMBER: 26,94 9 

(C) REFERENCE/ DOCKET NUMBER : 98 , 3 85 -A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-913-0001 

(B) TELEFAX:312-913-9808 




ERRORED SEQUENCES FOLLOW: 



3 9 (2) INFORMATION FOR SEQ ID NO:l: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383, 676 



DATE: 03/22/1999 
TIME: 13:50:52 



— > 



40 
41 
42 

43 
44 
45 
46 
47 
48 
49 
50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1365 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



INPUT SET: S3 1123. raw 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ATGGGCCTCT CCACCGTGCC TGACCTGCTG CTGCCACTGG TGCTCCTGGA 50 GCTGTTGGTG GGAATATACC 



— > 



51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATAGTGTGT GTCCCCAAGG AAAATATATC CACCCTCAAA ATAATTCGAT 



/ 



50 TTGCTGTACC AAGTGCCACA A 



63 (2) INFORMATION FOR SEQ ID NO: 3: 
64 

65 (i) SEQUENCE CHARACTERISTICS: a * i * 



— > 66 (A) LENGTH 

— > 67 (B) TYPE 

68 

— > 69 (Xi) SEQUENCE DESCRIPTION: SEQ I 

70 



TH:4 55 amino acids , Ajurn*- M> * & ***** ^ ,/ I * 

CD NO:3: u (JUp^y^ 



71 


Met 


Gly 


Leu 


Ser 


Thr 


Val 


Pro 


Asp 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Leu 


72 










5 










10 










15 


73 


Leu 


Glu 


Leu 


Leu 


Val 


Gly 


He 


Tyr 


Pro 


Ser 


Gly 


Val 


He 


Gly 


Leu 


74 










20 










25 










30 


75 


Val 


Pro 


His 


Leu 


Gly 


Asp 


Arg 


Glu 


Lys 


Arg 


Asp 


Ser 


Val 


Cys 


Pro 


76 










35 










40 










45 


77 


Gin 


Gly 


Lys 


Tyr 


He 


His 


Pro 


Gin 


Asn 


Asn 


Ser 


He 


Cys 


Cys 


Thr 


78 










50 










55 










60 


79 


Lys 


Cys 


His 


Lys 


Gly 


Thr 


Tyr 


Leu 


Tyr 


Asn 


Asp 


Cys 


Pro 


Gly 


Pro 


80 










65 










70 










75 


81 


Gly 


Gin 


Asp 


Thr 


Asp 


Cys 


Arg 


Glu 


Cys 


Glu 


Ser 


Gly 


Ser 


Phe 


Thr 


82 










80 










85 










90 


83 


Ala 


Ser 


Glu 


Asn 


His 


Leu 


Arg 


His 


Cys 


Leu 


Ser 


Cys 


Ser 


Lys 


Cys 


84 










95 










100 










105 


85 


Arg 


Lys 


Glu 


Met 


Gly 


Gin 


Val 


Glu 


He 


Ser 


Ser 


Cys 


Thr 


Val 


Asp 


86 










110 










115 










120 


87 


Arg 


Asp 


Thr 


Val 


Cys 


Gly 


Cys 


Arg 


Lys 


Asn 


Gin 


Tyr 


Arg 


His 


Tyr 


88 










125 










130 










135 


89 


Trp 


Ser 


Glu 


Asn 


Leu 


Phe 


Gin 


Cys 


Phe 


Asn 


Cys 


Ser 


Leu 


Cys 


Leu 


90 










140 










145 










150 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:52 



INPUT SET: S31123.mw 



Q 1 
7 1 


Asn Gly Thr 


i; a 1 
val 


til S 


Leu 


Ser 


Cys 


pi r\ 

vain 


ulu 


Lys 


uin 


Asn 


i nr 


Val 


7 Z 








IRS 










i fin 










ID J 


Q ^ 
7 O 




1 111. V/JfO 


til s 


Ala 


pi «t 

ijiy 


rile 


rne 


Leu 


Arg 


fin 
blU 


Asn 


ulU 


Cys 


Val 


7 TE 








J. / u 










1 

± / J 










J. o V 


Q C 


Ser 


Cys Ser 


Asn 


Cys 


Lys 


Lys 


Ser 


Leu 


pi n 


Cys 


Tnr 


Lys 


Leu 


Cys 


7 D 








ID J 










1?U 










1 Q C 


7 / 


Leu 


Pro Gin 


lie 


olU 


Asn 


vai 


Lys 


oiy 


Thr 


Pin 

ulU 


Asp 


Caw 


p i 
biy 


i nr 


Q Q 
70 








z uu 










one 
Z U D 










Z1U 


Q Q 

7 7 


Thr 


Val Leu 


Leu 


Pro 


Leu 


vai 


lie 


rfle 


rile 


p1 tr 

biy 


Leu 


Cys 


Leu 


Leu 


ion 
1 u u 








Z 1 D 










z z u 










Z Z j 


1 A1 

1U 1 


Ser 


Leu Leu 


rile 


Tl A 

lie 


ni 

uiy 


Leu 


Wa4- 


Tyr 


Arg 


Tyr 


bin 


Arg 


Trp 


Lys 


1 fi 0 


















o ^ r 
z j j 










9 a n 


1U j 


Ser 


Lys Leu 


Tyr 


Ser 


Tl — 

lie 


vai 


Cys 


biy 


Lys 


Ser 


Thr 


Pro 


pin 

bill 


Lys 


1 U4 








z4 0 










OCA 

ZO U 










Z D D 


1 A c 

1UO 


Glu Gly Glu 


Leu 


(jIU 


(jiy 


Thr 


Thr 


Thr 


Lys 


Pro 


Leu 


IV 1 »s 

Aia 


Pro 


Asn 


i a c 
1 U o 








OCA 

Z oil 










OCR 
ZD D 










OTA 

z / u 


107 


Pro 


Ser Phe 


Ser 


Pro 


Thr 


Pro 


uiy 


pne 


Tnr 


Pro 


Thr 


Leu 


Gly 


pne 


i f» ft 
1 U o 








O "7 C 

Z / o 










O ft A 
Z oU 










O ft c 

zoo 


1 A A 

109 


Ser 


Pro Val 


Pro 


Ser 


Ser 


Thr 


Pne 


Thr 


Ser 


Ser 


Ser 


Thr 


Tyr 


Thr 


110 








O O A 

Z 90 










IOC 

z95 










O A A 


111 


Pro 


Gly Asp Cys 


Pro 


Asn 


Pile 


Ala 


Ala 


Pro 


Arg 


Arg 


GlU 


Val 


Ala 




















0 1 A 










QIC 

jIj 


113 


Pro 


Pro Tyr 


Gin 


Gly 


Ala 


Asp 


Pro 


lie 


Leu 


Ala 


Thr 


Ala 


Leu 


TV 1 ~ 

Ala 


114 








J z u 










IOC 

JZD 










J JU 


115 


Ser 


Asp Pro 


He 


Pro 


Asn 


Pro 


Leu 


Gin 


Lys 


Trp 


Glu 


Asp 


Ser 


TV "I _ 

Ala 


TIC 

lib 








J J 3 










Oil A 










"3 A C 


117 


His 


Lys Pro 


Gin 


Ser 


Leu 


Asp 


Thr 


Asp 


Asp 


Pro 


Ala 


Thr 


Leu 


Tyr 


lift 
1 lo 








OCA 










JO J 












119 


Ala 


Val Val 


Glu 


Asn 


val 


Pro 


n w> a. 

pro 


Leu 


Arg 


Trp 


Lys 


G1U 


pne 


v«1 

vai 


i 1 a 
lzU 








ice 










OTA 










one 


1 Z 1 


Arg 


Arg Leu Gly 


Leu 


Ser 


Asp 


nlS 


pin 

blU 


lie 


Asp 


Arg 


Leu 


pin 

blU 


Leu 


1 z z 








1 ft A 










*3 ft C 












TOO 

lz J 


Gin Asn Gly Arg 


Cys 


Leu 


Arg 


/•till 

blU 


» 1 — 

Aia 


Pi m 

uin 


Tyr 


Ser 


Mex 


Leu 


Aia 


124 








395 










400 










405 


125 


Thr 


Trp Arg 


Arg 


Arg 


Thr 


Pro 


Arg 


Arg 


Glu 


Ala 


Thr 


Leu 


Glu 


Leu 


± Z D 








din 










air 










*x Z \J 


127 


Leu 


Gly Arg 


Val 


Leu 


Arg 


Asp 


Met 


Asp 


Leu 


Leu 


Gly 


Cys 


Leu 


Glu 


128 








425 










430 










435 


129 


Asp 


He Glu 


Glu 


Ala 


Leu 


Cys 


Gly 


Pro 


Ala 


Ala 


Leu 


Pro 


Pro 


Ala 


130 








440 










445 










450 


131 


Pro 


Ser Leu 


Leu 


Arg 






















132 








455 






















133 






























134 


(2) 


INFORMATION 


FOR 


SEQ 


ID N0:4 

















-> 
-> 



--> 



135 
136 
137 
138 
139 
140 
141 
142 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

( B ) TYPE : polypeptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:52 



1 j o 

14 3 










c 
D 


144 


Ser 


He 


Cys 


Cys 


Thr 


1 A C 

14b 










ZD 


146 


Asp 


Cys 


Pro 


Gly 


Pro 


147 










3 5 


148 


Ser 


Gly Ser 


Phe 


Thr 


149 










50 


150 


Ser 


Cys 


Ser 


Lys 


Cys 


151 










65 


152 


Ser 


Cys 


Thr 


Val 


Asp 


Id J 










oU 


154 


Gin 


Tyr 


Arg 


His 


Tyr 


155 










95 


156 


Cys 


Ser 


Leu 


Cys 


Leu 


157 










110 


158 


Lys 


Gin 


Asn 


Thr 


Val 


159 










125 


160 


Glu 


Asn 


Glu 


Cys 


Val 


161 










140 


162 


Cys 


Thr 


Lys 


Leu 


Cys 


163 










155 


164 





















10 




Lys 


Cys 


His 


Lys 


Gly 
25 


Thr 


Gly 


Gin 


Asp 


Thr 


Asp 
40 


Cys 


Ala 


Ser 


Glu 


Asn 


His 
55 


Leu 


Arg 


Lys 


Glu 


Met 


Gly 
70 


Gin 


Arg 


Asp 


Thr 


Val 


Cys 
85 


Gly 


Trp 


Ser 


Glu 


Asn 


Leu 
100 


Phe 


Asn 


Gly 


Thr 


Val 


His 
115 


Leu 


Cys 


Thr 


Cys 


His 


Ala 
130 


Gly 


Ser 


Cys 


Ser 


Asn 


Cys 
145 


Lys 


Leu 


Pro 


Gin 


He 


Glu 
160 


Asn 



INPUT SET: S31123.mw 









1 R 

1 D 


Tyr 


Leu 


Tyr 


Asn 








30 


Arg 


Glu 


Cys 


Glu 








45 


Arg 


His 


Cys 


Leu 








oU 


Val 


Glu 


He 


Ser 








/5 


Cys 


Arg 


Lys 


Asn 








90 


Gin 


Cys 


Phe 


Asn 








105 


Ser 


Cys 


Gin 


Glu 








120 


Phe 


Phe 


Leu 


Arg 








135 


Lys 


Ser 


Leu 


Glu 








150 



165 (2) INFORMATION FOR SEQ ID NO: 5: 
166 

167 (i) SEQUENCE CHARACTERISTICS: 

— > 168 (A) LENGTH: 157 bases 

169 (B) TYPE:nucleic acid 

170 (C) STRANDEDNESS: single 

171 (D) TOPOLOGY: linear 
172 

173 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
174 

175 CAGGGGAAA ATATTCACCC TCAAATAAT TCGATTTGCT GTACCAAGTG 50 CCACAAAGG AAACTACTTG TA 

176 



-> 
-> 



--> 



177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 aminp_acids 

( B ) TYP ^fpol ypep 1 1 d e 



ECS : 

^ oM CP) <T*#>"*# ; ^^r— 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Ser Val Xaa Pro Gin Gly Lys Tyr He His Pro Gin 

5 10 



■> 
■> 



188 
189 
190 
191 
192 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 annjxo^ acids 

(B) TYPEVrpolype^tidj 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:53 



INPUT SET: S3 1123. raw 



193 

— > X94 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

195 

196 Glu Asn Glu Cys Val Ser Cys Ser Asn Cys Lys 

197 5 10 
198 



199 (2) INFORMATION FOR SEQ ID NO: 8: 
200 

201 (i) SEQUENCE CHARACTERISTICS: 

— > 202 
--> 203 

204 

— > 205 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

206 

207 Glu Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys 

208 5 10 
209 



(A) LENGTHj_12 amjjUL acids * A A/\Jis 

(B) TYPE ^polyp^tide > Ay^J^''^ 



210 (2) INFORMATION FOR SEQ ID NO: 9: 
211 

212 (i) SEQUENCE CHARACTERISTICS 

— > 213 (A) LENGTH: 13 am^e-a^ids 

— > 214 (B) TYPpTpolype^tide 

215 

— > 216 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

217 

218 Tyr lie His Pro Gin Xaa Asn Ser lie Xaa Xaa Xaa Lys 



!: ty 



219 
220 



ixc nxo r-x. u uxu aoci a=3U •aei 11c Add Acta Acta bys ( f 

221 Misinformation for seq id noiio: . . % /Q^L/yi-JL' 

223 (i) SEQUENCE CHARACTERISTICS: ~T 

— > 224 (A) LENGTH: 14 amjj*o^acids 

— > 225 (B) TYPEf^olypeptide^ 

226 

— > 227 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

228 

22 9 Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn Asn Lys 

230 5 10 
231 

— > 232 (2) INFORMATION FOR SEQ ID NO: 11: 

233 

234 (i) SEQUENCE CHARACTERISTICS: 

— > 235 (A) LENGTH: 15 amin&_gcids 

— > 236 (B) TYPE,*Firlypep£ide^ 

237 

--> 238 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

239 

240 Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp Arg 

241 5 10 15 
242 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383, 676 



DATE: 03/22/1999 
TIME: 13:50:53 



INPUT SET: S3U23.mw 





243 




244 








Oil £ 












Oil a 




o <; 
/ 3 U 




Z 0 1 




ZoZ 




O C Q 

Z D J 




254 




255 




256 


> 


257 


> 


258 




259 


> 


260 




261 




262 




263 




264 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amijioacids 

(B) TYP ^polypeptide^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gly Thr Tyr Ley Tyr Asn Asp Cys Pro Gly Pro Gly Gin 

5 10 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 a mino acids 

(B) TYl^ Efpb lypejrt i 3eNy ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Glu Met Gly Gin Val Glu lie Ser Xaa Xaa Xaa Val Asp 

5 10 





265 




266 




267 


— > 


268 


— > 


269 




270 


— > 


271 




272 




273 




274 




275 




276 




277 




278 




279 




280 


— > 


281 


— > 


282 




283 


— > 


284 




285 




286 




287 




288 




289 




290 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amyio. -acids 

(B) TY^ EfTpo lypep t i de ^^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp Arg 

5 10 15 

Asp Thr Val Cys Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 ami»e--acids 



(A) LENGTH: 19 am^o-^cii 

( B ) TYP^Tp&lypeptide^^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr lie His Pro Gin Xaa Asn Ser lie Cys Cys Thr Lys Cys His 

5 .10 15 

Lys Gly Xaa Tyr 



291 (2) INFORMATION FOR SEQ ID NO: 16: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
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INPUT SET: S3 1123. raw 





z y z 




z y j 
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294 
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295 




Z Z70 


> 


o r% *y 








O Q Q 

z y y 
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1 A 1 

JU 1 




JU z 




O A O 




304 




305 




306 


> 


307 


> 


308 




309 


> 


310 




311 




312 




313 




314 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYP^ rpoly^eptid e^) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr 

5 10 15 

Xaa Xaa Arg 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino— ft€k\ds 

(B) TYPB^rlypeptide J 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Gin Asn Thr Val Cys Thr Xaa His Ala Gly Phe Phe Leu Arg 

5 10 





315 


(2) INFORMATION FOR SEQ ID NO: 18: 








316 










317 


(i) SEQUENCE CHARACTERISTICS: 






— > 


318 


(A) LENGTH: 14 am>»e--*£ids 






— > 


319 

320 


( B ) TYPjfrfpol ypep t i de_3 






— > 


321 

322 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 








323 


Ser Leu Glu Cys Thr Lys Leu Cys Leu Pro Gin 


He 


Glu Asn 




324 


5 10 








325 










326 


(2) INFORMATION FOR SEQ ID NO: 19: 








327 










328 


(i) SEQUENCE CHARACTERISTICS: 






— > 


329 


(A) LENGTH: 13 amino— aqids 






— > 


330 

331 


(B) TYPE^po^ypeptide^) 






— > 


332 

333 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 








334 


Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His 


Pro 


Gin 




335 


5 10 








336 










481 


(2) INFORMATION FOR SEQ ID NO: 32: 








482 










483 


(i) SEQUENCE CHARACTERISTICS: 






— > 


484 


(A) LENGTH: 14 amino acids 







PAGE: 8 RAW SEQUENCE LISTING DATE: 03/22/1999 

PATENT APPLICATION US/08/383,676 TIME: 13:50:54 

INPUT SET: S3U23.raw 

— > 485 (B) TYP^: polypeptide 

486 

--> 487 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

488 

489 Glu Cys Gly Ser Gly Ser Phe Thr Ala Ser Glu Asn Asn Lys 

490 5 10 
491 

4 92 (2) INFORMATION FOR SEQ ID NO: 33: 
493 

494 (i) SEQUENCE CHARACTERISTICS: 

--> 495 (A) LENGTH: 14 amjjiou^cids 

— > 496 (B) TYPE3<^dlype^tide 

4 97 " 

--> 498 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

499 

500 Glu Cys Gly Ser Gly Ser Phe Thr Ala Ser Cys Asn Asn Lys 

501 5 10 
502 



647 


(2) INFORMATION FOR SEQ ID NO: 46: 










648 












649 


(i) SEQUENCE CHARACTERISTICS: 










--> 650 


(A) LENGTH: 51 amino acids 










--> 651 

652 


(B) TYI^polyi^ptiae^s 










— > 653 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 








654 












655 


Gin Gly Lys Tyr lie His Pro Gin Asn 


Asn 


Ser 


lie Cys 


Cys Thr 


656 


5 


10 






15 


657 


Lys Cys His Lys Gly Thr Tyr Ley Tyr 


Asn 


Asp 


Cys Pro 


Gly Pro 


658 


20 


25 






30 


659 


Gly Gin Asp Thr Asp Cys Arg Gly Cys 


Glu 


Ser 


Gly Ser 


Phe Thr 


660 


35 


40 






45 


661 


Ala Ser Glu Asn Asn Lys 










662 


50 










663 













664 (2) INFORMATION FOR SEQ ID NO: 47: 
665 

666 (i) SEQUENCE CHARACTERISTICS: 
— > 667 (A) LENGTH: 158 bases 
668 
669 

670 (D) TOPOLOGY: linear 
671 

672 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
673 

674 CAGGGGAAAT ATATTCACCC TCAAAATAAT TCGATTTCGT GTACCAAGTC 50 GCACAAAGGA ACCTACTTGT A 



(A) LENGTH: 158 bases 

(B) TYPE:nucleic acid A . J , 

(C) STRANDEDNESS: single U-&h-&\ sJ^ 1 ***^ 
(Ti\ TnpnmfiVflinMr 



675 



762 (2) INFORMATION FOR SEQ ID NO: 55: 
763 



PAGE: 9 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:54 



INPUT SET: S3U23.raw 

764 (i) SEQUENCE CHARACTERISTICS: 

— > 765 (A) LENGTH: 63 bases 

766 (B) TYPE: nucleic acid 

767 (C) STRANDEDNESS: single 

768 (D) TOPOLOGY: linear 
769 

770 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
771 

772 AGCTCTAGAG ATTCGCGGCC GCTCGAGGTA CCGGATCCAT CGATGTCGAC 50 CTGCAGAAGC TTG 
773 



774 (2) INFORMATION FOR SEQ ID NO: 56: 
775 

776 (i) SEQUENCE CHARACTERISTICS: 

— > 777 (A) LENGTH: 64 bases 

778 (B) TYPE: nucleic acid 

779 (C) STRANDEDNESS: single 

780 (D) TOPOLOGY: linear 
781 

782 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
783 

784 CTAGCAAGCT TCTGCAGGTC GACATCGATG GATCCGGTAC CTCGAGCGGC 50 CGCGAATTCT CTAG 
785 



810 (2) INFORMATION FOR SEQ ID NO: 59: 
811 

812 (i) SEQUENCE CHARACTERISTICS: 

— > 813 (A) LENGTH: 81 bases 

814 (B) TYPE:nucleic acid 

815 <C) STRANDEDNESS: single 

816 (D) TOPOLOGY: linear 
817 

818 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
819 



820 CACAGTCGAC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 50 CAAACAGACA CCATGGGCCT 
821 





870 


(2) INFORMATION FOR SEQ ID NO: 64: 






871 








872 


(i) SEQUENCE. CHARACTERISTICS: . 




— > 


873 


(A) LENGTH^O ba£es^> /<? //U/fJ 






874 


(B) TYPE : nucT^tc^acid ( Q f**^ 






875 


(C) STRANDEDNESS: single 






876 


(D) TOPOLOGY: linear 






877 








878 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 






879 








880 


GTCCAATTAT GTCACACC 


18 




881 








882 


(2) INFORMATION FOR SEQ ID NO: 65: 






883 








. 884 


(i) SEQUENCE CHARACTERISTICS: 





PAGE: 10 RAW SEQUENCE LISTING DATE: 03/22/1999 

PATENT APPLICATION US/08/383,676 TIME: 13:50:55 



INPUT SET: S31123.raw 

— > 885 (A) LENGTH: 1334 bases 

886 (B) TYPE: nucleic acid 

887 (C) STRANDEDNESS: Single 

888 (D) TOPOLOGY: linear 
889 

890 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
891 

892 GAATTCTCTG GACTGAGGCT CCAGTTCTGG CCTTTGGGGT TCAAGATCAC 50 TGGGACCAGG CCGTGAT 

893 

— > 894 TAGCTGTCTG GC 212 

895 

896 ATG GGC CTC TCC ACC GTG CCT GAC CTG CTG CTG CCA CTG GTG TTC 257 

897 Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu 

898 5 10 15 
899 

900 CTG GAG CTG TTG GTG GGA ATA TAC CCC TCA GGG GTT ATT GGA CTG 302 

901 Leu Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu 

902 20 25 30 
903 

904 GTC CCT CAC CTA GGG GAC AGG GAG AAG AG A GAT AGT GTG TGT CCC 347 

905 Val Pro His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro 

906 35 40 45 
907 

908 CAA GGA AAA TAT ATC CAC CCT CAA AAT AAT TCG ATT TGC TGT ACC 3 92 

909 Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr 

910 50 55 60 
911 

912 AAG TGC CAC AAA GGA ACC TAC TTG TAC AAT GAC TGT CCA GGC CCG 4 37 

913 Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro 

914 65 70 75 
915 

916 GGG CAG GAT ACG GAC TGC AGG GAG TGT GAG AGC GGC TCC TTC ACC 482 

917 Gly Gin Asp The Asp Cys Arg Glu Cys Gly Ser Gly Ser Phe Thr 

918 80 85 90 
919 

920 GCT TCA GAA AAC CAC CTC AG A CAC TGC CTC AGC TGC TCC AAA TGC 52 7 

921 Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys Cys 

922 95 100 105 
923 

924 CGA AAG GAA ATC GGT CAG GTG GAG ATC TCT TCT TGC AC A GTG GAC 57 2 

925 Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp 

926 110 115 120 
927 

928 CGG GAC ACC GTG TGT GGC TGC AGG AAG AAC CAG TAC CGG CAT TAT 617 

929 Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 

930 125 130 135 
931 

9 32 TGG AGT GAA AAC CTTTTC CAG TGC TTC AAT TGC AGC CTC TGC CTC 66 2 

933 Trp Ser Glu Asn Leu^he Gin Cys Phe Asn Cys Ser Leu Cys Leu 

lit 14 S^^>^-^-^^ 

936 AAT GGG ACC GTG CAC CTC TCC TGC CAG GAG AAA CAG AAC ACC GTG 707 Iff 

937 Asn Gly Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val AM A*^****? 

(am /fc^)^ 



PAGE: 1 1 RAW SEQUENCE LISTING DATE: 03/22/1999 

PATENT APPLICATION US/08/383,676 TIME: 13:50:55 

INPUT SET: S31123.mw 

938 155 160 165 
939 

940 TGC ACC TGC CAT GCA GGT TTC TTT CTA AGA GAA AAC GAG TGT GTC 752 

941 Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val 

942 170 175 180 
943 

944 TCC TGT AGT AAC TGT AAG AAA AGC CTG GAG TGC AGG AAG TTG TGC 7 97 

945 Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu Cys 

946 185 190 195 
947 

948 CTA CCC CAG ATT GAG AAT GTT AAG GGC ACT GAG GAC TCA GGC ACC 842 

94 9 Leu Pro Gin He Glu Asn Val Lys Gly Thr Glu Asp Ser Gly Thr 
950 200 205 210 
951 

952 ACA GTG CTG TTG CCC CTG GTC ATT TTC TTT GGT CTT TGC CTT TTA 887 

95 3 Thr Val Leu Leu ProLeu Val He Phe Phe Gly Leu Cys Leu Leu 

954 21S^^hf~l 220 225 

956 TCC CTC CTC TTC ATT GGT TTA ATG TAT CGC TAC CAA CGG TGG AAG 932 

957 Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 

958 230 235 240 
959 

960 TCC AAG CTC TAC TCC ATT GTT TGT GGG AAA TCG ACA CCT GAA AAA 977 

961 Ser Lys Leu Tyr Ser He Val Cys Gly Lys Ser Thr Pro Glu Lys 

962 245 250 255 
963 

964 GAG GGG GAG CTT GAA GGA ACT ACT ACT AAG CCC CTG GCC CCA AAC 1022 

96 5 Glu Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn 
966 260 265 270 
967 

968 CCA AGC TTC AGT CCC ACT CCA GGC TTC ACC CCC ACC CTG GGC TTC 1067 

969 Pro Ser Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe 

970 275 280 285 
971 

972 AGT CCC GTG CCC AGT TCC ACC TTC ACC TCC AGC TCC ACC TAT ACC 1112 

973 Ser Pro Val Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr 

974 290 295 300 
975 

97 6 CCC GGT GAC TGT CCC AAC TTT GCG GCT CCC CGC AGA GAG GTG GCA 1157 

977 Pro Gly Asp Cys Pro Asn Phe Ala Ala Pro Arg Arg Glu Val Ala 

978 305 310 315 
979 

980 CCA CCC TAT CAG GGG GCT GAC CCC ATC CTT GCG ACA GCC CTC GCC 1202 

981 Pro Pro Tyr Gin Gly Ala Asp Pro He Leu Ala Thr Ala Leu Ala 

982 320 325 330 
983 

984 TCC GAC CCC ATC CCC AAC CCC CTT CAG AAG TGG GAG GAC AGC GCC 1247 

985 Ser Asp Pro He Pro Asn Pro Leu Gin Lys Trp Glu Asp Ser Ala 

986 335 340 345 
987 

988 CAC AAG CCA CAG AGC CTA GAC ACT GAT GAC CCC GCG ACG CTG TAC 12 92 

989 His Lys Pro Gin Ser Leu Asp Thr Asp Asp Pro Ala Thr Leu Tyr 

990 350 355 360 



PAGE: 12 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383, 676 



DATE: 03/22/1999 
TIME: 13:50:55 



INPUT SET: S3U23.raw 

991 

992 GCC GTG GTG GAG AAC GTG CCC CCG TTG CGC TGG AAGGAATTC 1334 

993 Ala Val Val Glu Asn Val Pro Pro Leu Arg Trp 

994 365 370 
995 

996 



997 (2) INFORMATION FOR SEQ ID NO: 66: 
998 

999 (i) SEQUENCE CHARACTERISTICS: 

--> 1000 (A) LENGTH: 371 aipjj&a^acids 

--> 1001 (B) TYPl^Tpolypeptide^) 

1002 ■ 

— > 1003 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

1004 

1005 Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu 

1006 5 10 15 

1007 Leu Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu 

1008 20 25 30 

1009 Val Pro His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro 

1010 35 40 45 

1011 Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr 

1012 50 55 60 

1013 Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro 

1014 65 70 75 

1015 Gly Gin Asp The Asp Cys Arg Glu Cys Gly Ser Gly Ser Phe Thr 

1016 80 85 90 

1017 Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys Cys 

1018 95 100 105 

1019 Arg Lys Glu Met Gly Gin Val Glu He Ser Ser Cys Thr Val Asp 

1020 110 115 120 

1021 Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 

1022 125£'<vS?^7 130 135 

1023 Trp Ser Glu Asn LeuPhe Gin" Cys Phe Asn Cys Ser Leu Cys Leu 

1024 140 145 150 

1025 Asn Gly Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val 

1026 155 160 165 

1027 Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val 

1028 170 175 180 

1029 Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu Cys 

1030 185 190 195 

1031 Leu Pro Gin He Glu Asn Val Lys Gly Thr Glu Asp Ser Gly Thr 

1032 200 205 210 
10 3 3 Thr Val Leu Leu PraLeu Val He Phe Phe Gly Leu Cys Leu Leu 

1034 215^0^07, 220 225 

1035 Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 

1036 230 235 240 

1037 Ser Lys Leu Tyr Ser He Val Cys Gly Lys Ser Thr Pro Glu Lys 

1038 245 250 255 

1039 Glu Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn 

1040 260 265 270 

1041 Pro Ser Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe 

1042 275 280 285 



PAGE: 13 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/081383 ,676 



DATE: 03/22/1999 
TIME: 13:50:56 



INPUT SET: S3U23.mw 

1043 Ser Pro Val Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr 

1044 290 295 300 

1045 Pro Gly Asp Cys Pro Asn Phe Ala Ala Pro Arg Arg Glu Val Ala 

1046 305 310 315 

1047 Pro Pro Tyr Gin Gly Ala Asp Pro lie Leu Ala Thr Ala Leu Ala 

1048 320 325 330 

1049 Ser Asp Pro lie Pro Asn Pro Leu Gin Lys Trp Glu Asp Ser Ala 

1050 335 340 345 

1051 His Lys Pro Gin Ser Leu Asp Thr Asp Asp Pro Ala Thr Leu Tyr 

1052 350 355 360 

1053 Ala Val Val Glu Asn Val Pro Pro Leu Arg Trp 

1054 365 370 
1055 



1056 (2) INFORMATION FOR SEQ ID NO: 67: 
1057 

1058 (i) SEQUENCE CHARACTERISTICS: 

— > 1059 (A) LENGTH: 6464 bases */ 

1060 (B) TYPE:nucleic acid U jjJ ^ 

1061 (C) STRANDEDNESS: single f ^ ^ 

1062 (D) TOPOLOGY: linear Vjh^ 

1063 ' 

1064 (xi) SEQUENCE DESCRIPTION: SEQ ID' NO : 67 : / 

1065 J 

1066 TCGACATTGA TTATTGACTA GTTATTAATA GTAATCAATT ACGGGGTCAT 50 (6 

1067 TAGTTCATAG CCCATATATG GAGTTCCGCG TTACATAACT TACGGTAAAT 100 GGCCCGCCTC GCTGACCGCC C 

1068 CTGGCATTAT GCCCAGTACA TGACCTTATG GGACTTTCCT ACTIGGCAGT 350 ACATCTACGT ATTAGTCATC G 

106 9 AGAACCCACT GCTTAACTGG CTTATCGAAA TTAATACGAC TCACTATAGG 650 GAGACCCAAG CTTCTGCAGG T 

1070 AGGTTTTACT TGCTTTAAAA AACCTCCCAC ACCTCCCCCT GAACCTGAAA 1250 CATAAAATGA ATGCAATTGT T 

1071 GTTAAAAGAT GTATCCTGGA CCTGCCAGAC CTGGCCATTC ACGTAAACAG 1550 AAGATTCCGC CTCAAGTTCC G 

1072 CCCACCACCT GGCCCCGCCC CGTTGAGGAC AGAAGAAACC CCGGGCAGCC 1850 GCAGCCAAGG CGGACGGGTA G 

107 3 GCGGCCACGC CGGACTGGGC GGGGCCGGCC TGGTGGAGGC GGAGTCTGAC 2150 CTCGTGGAGG CGGGGCCTCT G 

1074 TGGCTGGATT GGGTTAGGGA AACCGAGGCG GTTCGCTGAA TCGGGTCGAG 2450 CACTTGGCGG AGACGCGCGG G 

1075 GAACGAGTTC AAGTACTTCC AAAGAATGAC CACCACCTCC TCAGTGGAAG 2750 GTAAACAGAA CCTGGTGATT A 

1076 AGGCCATCTC AGACTCTTTG TGACAAGGAT CATGCAGGAA TTTGAAAGTG 3050 ACACGTTCTT CCCAGAAATT G 

1077 GCAGAGCTAG AACTCAGACT TTAAAGAAAA TTAGATCAAA GTAGAGACTG 3650 AATTAT.TCTG CACATCAGAC T 
107 8 ATGCATCTCA ATTAGTCAGC AACCAGGTGT GGAAAGTCCC CAGGCTCCCC 4000 AGCAGGCAGA AGTATGCAAA G 
107 9 CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG CGCAGCGTGA 4255 CCGCTACACT TGCCAGCGCC C 

1080 GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG TTTTTGCTCA 4850 CCCAGAAACG CTGGTGAAAG T 

1081 GAAAAGCATC TTACGGATGG CATGACAGTA AGAGKATTAT GCAGTGCTGC 5150 CATAACCATG AGTGATAACA C 

1082 TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC 5450 GGCTGGCTGG TTTATTGCTG A 

1083 AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC 5750 TTAACGTGAG TTTTCGTTCC A 

1084 GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC 6100 TGCCAGTGGC GATAAGTCGT G 

1085 GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT 6400 TGAGCGTCGA TTTTTGTGAT G 
1086 



1087 (2) INFORMATION FOR SEQ ID NO: 68: 
1088 

1089 (i) SEQUENCE CHARACTERISTICS: 

— > 1090 (A) LENGTH: 2173 bases 

1091 (B) TYPE:nucleic acid 

1092 (C) STRANDEDNESS: single 

1093 (D) TOPOLOGY: linear 



PAGE: 



14 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383, 676 



DATE: 03/22/1999 
TIME: 13:50:56 



I 



INPUT SET: S3 1123. raw 

1094 

1095 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
1096 

1097 GAATTCCTTT TCTCCGAGTT TTCTGAACTC TGGCTCATGA TCGGGCTTAC 50 TGGATACGAG AATCCTG 

1098 

--> 1099 ATG GGT CTC CCC ATC GTG CCT GGC CTG CTG CTG TCA CTG GTG CTC 289 

1100 Met Gly Leu Pro lie Val Pro Gly Leu Leu Leu Ser Leu Val Leu 

1101 5 10 15 
1102 

1103 CTG GCT CTG CTG ATG GGG ATA CAC CCA TCA GGG GTC ACC GGA CTG 334 

1104 Leu Ala Leu Leu Met Gly lie His Pro Ser Gly Val Thr Gly Leu 

1105 20 25 30 
1106 

1107 GTT CCT TCT CTT GGT GAC CGG GAG AAG AGG GAT AAT TTG TGT CCC 37 9 

1108 Val Pro Ser Leu Gly Asp Ara Glu Lys Arg Asp Asn Leu Cys Pro 

1109 35 40 45 
1110 

1111 CAG GGA AAG TAT GCC CAT CCA AAG AAT AAT TCC ATC TGC TGC ACC 424 

1112 Gin Gly Lys Tyr Ala His Pro Lys Asn Asn Ser lie Cys Cvs Thr 

1113 50 55 60 
1114 

1115 AAG TGC CAC AAA GGA ACC TAC TTG GTG AGT GAC TGT CCA AGC CCA 469 

1116 Lys Cys His Lys Gly Thr Tyr Leu Val Ser Asp Cys Pro Ser Pro 

1117 65 70 75 
1118 

1119 GGG CAG GAA ACA GTC TGC GAG CTC TCT CAT AAA GGC ACC TTT ACA 514 

1120 Gly Gin Glu Thr Val Cys Glu Leu Ser His Lys Gly Thr Phe Thr 

1121 80 85 90 
1122 

112 3 GCT TCG CAG AAC CAC GTC AGA CAG TGT CTC AGT TGC AAG ACA TGT 55 9 

1124 Ala Ser Gin Asn His Val Arg Gin Cys Leu Ser Cys Lys Thr Cys 

1125 95 100 105 
1126 

1127 CGG AAA GAA ATG TTC CAG GTG GAG ATT TCT CCT TGC AAA GCT GAC 604 

1128 Arg Lys Glu Yet Phe Gin Val Glu lie Ser Pro Cys Lys Ala Asp ^ 

1130 11 a^^U^J^^^ 

— > 1131 ATG GAC ACC GTG TGT GGC TGC AAG (HAG Ajfcte CAA TTC CAG CGC TAC 649 

1132 Met Asp Thr Val Cys Gly Cys Lys Lys Asm Gin Phe Gin Arg Tyr 

1133 125 130/ ,135 

1134 +p^U Vy^K-<L^U JOJCMaV 

1135 CTG AGT GAG ACG CAT TTC CAG TGT GTG GAC TGC AGC CCC TGC TTC 694 

1136 Leu Ser Glu Thr His Phe Gin Cys Val Asp Cys Ser Pro Cys Phe 

1137 140 145 150 
1138 

1139 AAT GGC ACC GTG ACA ATC CCC TGT AAG GAG AAA CAG AAC ACC GTG 73 9 

1140 Asn Gly Thr Val Thr lie Pro Cys Lys Glu Lys Gin Asn Thr Val 

1141 155 160 165 
1142 

114 3 TGT AAC TGC CAC GCA GGA TTC TTT CTA AGC GGA AAT GAG TGC ACC 784 

1144 Cys Asn Cys His Ala Gly Phe Phe Leu Ser Gly Asn Glu Cys Thr 

1145 170 175 180 
1146 



PAGE: 15 RAW SEQUENCE LISTING DATE: 03/22/1999 

PATENT APPLICATION US/08/383,676 TIME: 13:50:56 

INPUT SET: S3 1123. raw 
114 7 CCT TGC AGC CAC TGC AAG AAA AAT CAG GAA TGT ATG AAG CTG TGC 82 9 

1148 Pro Cys Ser His Cys Lys Lys Asn Gin Glu Cys Met Lys Leu Cys 

1149 185 190 195 
1150 

1151 CTA CCT CCA GTT GCA AAT GTC AC A AAC CCC CAG GAC TCA GGT ACT 874 

1152 Leu Pro Pro Val Ala Asn Val Thr Asn Pro Gin Asp Ser Gly Thr 
1153 
1154 

— > 1155 



CTG TTG CCT CTG GTT ATC TTC CTA GGT CTT TGC CTT TTA 919 /QJbzSJ+)~ 



1156 Ala Val Leu Leu Pro Leu Val lie Phe Leu Gly Leu Cys Leu Leu 

1157 215 220 225 
1158 

115 9 TTC TTT ATC TGC ATC AGT CTA CTG TGC CGA TAT CCC CAG TGG AGG 964 

1160 Phe Phe lie Cys lie Ser Leu Leu Cys Arg Tyr Pro Gin Trp Arg 

1161 230 235 240 
1162 

116 3 CCC AGG GTC TAC TCC ATC ATT TGT AGG GAT TCA GCT CCT GTC AAA 1009 

1164 Pro Arg Val Tyr Ser lie lie Cys Arg Asp Ser Ala Pro Val Lys 

1165 245 250 255 
1166 

1167 GAG GTG GAG GGT GAA GGA ATT GTT ACT AAG CCC CTA ACT CCA GCC 1054 

1168 Glu Val Glu Gly Glu Gly lie Val Thr Lys Pro Leu Thr Pro Ala 

1169 260 265 270 
1170 

1171 TCT ATC CCA GCC TTC AGC CCC AAC CCC GGC TTC AAC CCC ACT CTG 1099 

1172 Ser lie Pro Ala Phe Ser Pro Asn Pro Gly Phe Asn Pro Thr Leu 

1173 275 280 285 
1174 

1175 GGC TTC AGC ACC ACC CCA CGC TTC AGT CAT CCT GTC TCC AGT ACC 1144 

1176 Gly Phe Ser Thr Thr Pro Arg Phe Ser His Pro Val Ser Ser Thr 

1177 290 295 300 
1178 

117 9 CCC ATC AGC CCC GTC TTC GGT CCT AGT AAC TGG CAC AAC TTC GTG 1189 

1180 Pro He Ser Pro Val Phe Gly Pro Ser Asn Trp His Asn Phe Val 

1181 305 310 315 
1182 

118 3 CCA CCT GTA AG A GAG GTG GTC CCA ACC CAG GGT GCT GAC CCT CTC 1234 

1184 Pro Pro Val Arg Glu Val Val Pro Thr Gin Gly Ala Asp Pro Leu 

1185 320 325 330 
1186 

1187 CTC TAC GGA TCC CTC AAC CCT GTG CCA ATC CCC GCC CCT GTT CGG 127 9 

1188 Leu Tyr Gly Ser Leu Asn Pro Val Pro He Pro Ala Pro Val Arg 

1189 335 340 345 
1190 

1191 AAA TGG GAA GAC GTC GTC GCG GCC CAG CCA CAA CGG CTT GAC ACT 1324 

1192 Lys Trp Glu Asp Val Val Ala Ala Gin Pro Gin Arg Leu Asp Thr 

1193 350 355 360 
1194 

1195 GCA GAC CCT GCG ATG CTG TAT GCT GTG GTG GAT GGC GTG CCT CCG 136 9 

1196 Ala Asp Pro Ala Met Leu Tyr Ala Val Val Asp Gly Val Pro Pro 

1197 365 370 375 
1198 

119 9 ACA CGC TGG AAG GAG TTC ATG CGG CTC CTG GGG CTG AGC GAG CAC 1414 



PAGE: 16 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383, 676 



DATE: 03/22/1999 
TIME: 13:50:57 



— > 



1200 
1201 
1202 
1203 
1204 
1205 
1206 
1207 
1208 
1209 
1210 
1211 
1212 
1213 
1214 
1215 
1216 
1217 
1218 
1219 
1220 
1221 
1222 
1223 
1224 
1225 
1226 
1227 
1228 
1229 
1230 
1231 



INPUT SET: S3U23.mw 
Thr Arg Trp Lys Glu Phe Met Arg Leu Leu Gly Leu Ser Glu His 
380 385 390 

GAG ATC GAG CGG TTG GAG CTG CAG AAC GGG CGT TGC CTC CGC GAG 1459 
Glu lie Glu Arc Leu Glu Leu Gin Asn Gly Arg Cys Leu Arg Glu 
395 400 405 

GCT CAT TAC AGC ATG CTG GAA GCC TGG CGG CGC CGC ACA CCG CGA 1504 
Ala His Tyr Ser Met Leu Glu Ala Trp Arg Arg Arg Thr Pro Arg 
410 415 420 

CAC GAG GCC ACG CTG GAC GTA GTG GGC CGC GTG CTT TGC GAC ATG 1549 
His Glu Ala Thr Leu Asp Val Val Gly Arg Val Leu Cys Asp Met 
425 430 435 

AAC CTG CGT GGC TGC CTG GAG AAC ATC CGC GAG ACT CTA GAA AGC 1594 
Asn Leu Arg Gly Cys Leu Glu Asn lie Arg Glu Thr Leu Glu Ser 
440 445 450 



CCT GCC CAC TCG TCC ACG ACC CAC CTC CCG CGA TAAGGCCACA 
Pro Ala His Ser Ser Thr Thr His Leu Pro Arg 
455 460 



1637 



CCCCCACCTC 
GGGAAGTGTG 
GAGGTGATTG 
CAAGGACCCA 
CTTTTCACAG 
GATGAACGGT 
CTCCAGCTGG 
AAAAAAAAAA 



AGGAACGGGA 
CCATCTGCTC 
TGGAGAAAAA 
AACAAAGCTT 
TAGATAAAAC 
TGAACTCCCT 
AGCCCCCGAC 
AAAAAAAAAA 



CTCGAAGGAC 
CCAGACAGCT 
GCACAATCTA 
CTCAGGGCCT 
AGTCTTTGTA 
AAGGTAGGGG 
TCTTGTAAAT 
AAAAAAAAAG 



CATCCTGCTA 
GAGGGTGCCA 

TCTGATACCC 
CCTCAGTTGA 
TTGATTATAT 
CAAGCACAGA 
ACACTAAAAA 
GAATTC 



GATGCCCTGC 
AAAGCCAGGA 

ACTTGGGATG 
TTTCTGGGCC 
CACACTAATG 
ACAGTGGGGT 
TCTAAAAGTG 



1687 
1887 

1937 
1987 
2037 
2087 
2137 
2173 



TTCCCTGTGA ACCTCCT 



1232 (2) INFORMATION FOR SEQ ID NO: 69: 
1233 

1234 (i) SEQUENCE CHARACTERISTICS: 

— > 1235 (A) LENGTH: 461 amino acids 

— > 1236 (B) TYP tfTpb lypep t i de\ 

1237 ^ _ 

— > 1238 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

1239 



1240 


Met 


Gly 


Leu 


Pro 


He 


Val 


Pro 


Gly 


Leu 


Leu 


Leu 


Ser 


Leu 


Val 


Leu 


1241 










5 










10 










15 


1242 


Leu 


Ala 


Leu 


Leu 


Met 


Gly 


He 


His 


Pro 


Ser 


Gly 


Val 


Thr 


Gly 


Leu 


1243 










20 










25 










30 


1244 


Val 


Pro 


Ser 


Leu 


Gly 


Asp 


Ara 


Glu 


Lys 


Arg 


Asp 


Asn 


Leu 


Cys 


Pro 


1245 










35 










40 










45 


1246 
































1247 


Gin 


Gly 


Lys 


Tyr 


Ala 


His 


Pro 


Lys 


Asn 


Asn 


Ser 


He 


Cys 


Cvs 


Thr 


1248 










50 










55 










60 


1249 


Lys 


Cys 


His 


Lys 


Gly 


Thr 


Tyr 


Leu 


Val 


Ser 


Asp 


Cys 


Pro 


Ser 


Pro 


1250 










65 










70 










75 


1251 


Gly 


Gin 


Glu 


Thr 


Val 


Cys 


Glu 


Leu 


Ser 


His 


Lys 


Gly 


Thr 


Phe 


Thr 



PAGE: 17 RAW SEQUENCE LISTING DATE: 03/22/1999 

PATENT APPLICATION US/08/383,676 TIME: 13:50:57 

INPUT SET: S31123.mw 



i ICO 










Q ft 

o U 










Q C 
t$3 










Q ft 


1 o c o 


Ala 


Ser 


Gin 


Asn 


TT i „ 

hi s 


vai 


Arg 


Gin 


Cys 


Leu 


C* A V 

•ber 


Cys 


Lys 


rpU v 

inr 


Cys 


1 Z 34 










y s 










1UU 










1 ft c 
1 y 3 


i o c c 

lz55 


ft 

Arg 


Lys 


Glu 


Yet 


Pne 


Gin 


vai 


Glu 


lie 


Ser 


Pro 


Cys 


Lys 


Aia 


Asp 


i o c c 










11 u 










113 










1 OA 

X z u 


1257 


Met 


Asp 


Thr 


Val 


Cys 


Gly 


Cys 


Lys 


Lys 


Asn 


Gin 


Pne 


Gin 


Arg 


Tyr 


TOCO 

1258 










i o c 

125 










130 










IOC 

135 


1259 


Leu 


Ser 


Glu 


Thr 


His 


Pne 


Gin 


Cys 


Val 


Asp 


Cys 


Ser 


Pro 


Cys 


Pne 


1 1 C A 

1260 










1 i A 

140 










145 










1 C A 

150 


1261 


Asn 


Gly Thr 


Val 


Thr 


He 


Pro 


Cys 


Lys 


Glu 


Lys 


Gin 


Asn 


Thr 


Val 


Izoz 










ICC 

13 3 










1 C ft 

1 o 0 










ICC 
1 O 3 


i o o 

1263 


Cys 


Asn 


Cys 


His 


ft "1 „ 

Ala 


Gly 


Pne 


Pne 


Leu 


Ser 


Gly 


Asn 


Glu 


Cys 


Thr 












1 *7 ft 
1 /U 










1/3 










1 O ft 

1 oO 


1265 


Pro 


Cys 


Ser 


His 


Cys 


Lys 


Lys 


Asn 


Gin 


Glu 


Cys 


Met 


Lys 


Leu 


Cys 


1266 










1 ft r 

185 










190 










195 


1267 


Leu 


Pro 


Pro 


Val 


Ala 


Asn 


Val 


Thr 


Asn 


Pro 


Gin 


Asp 


Ser 


Gly 


Thr 


1268 










200 










205 










210 


1269 


Ala 


Val 


Leu 


Leu 


Pro 


Leu 


Val 


He 


Phe 


Leu 


Gly 


Leu 


Cys 


Leu 


Leu 


1270 










215 










220 










225 


1271 


Phe 


Phe 


He 


Cys 


He 


Ser 


Leu 


Leu 


Cys 


Arg 


Tyr 


Pro 


Gin 


Trp 


Arg 


1272 










230 










235 










240 


1273 


Pro 


Arg 


Val 


Tyr 


Ser 


He 


He 


Cys 


Arg 


Asp 


Ser 


Ala 


Pro 


Val 


Lys 


1274 










245 










250 










255 


1275 


Glu 


Val 


Glu Gly 


Glu 


Gly 


He 


Val 


Thr 


Lys 


Pro 


Leu 


Thr 


Pro 


Ala 


1276 










260 










265 










270 


1277 


Ser 


He 


Pro 


Ala 


Phe 


Ser 


Pro 


Asn 


Pro 


Gly 


Phe 


Asn 


Pro 


Thr 


Leu 


1278 










275 










280 










285 


1279 


Gly Phe 


Ser 


Thr 


Thr 


Pro 


Arg 


Phe 


Ser 


His 


Pro 


Val 


Ser 


Ser 


Thr 


1280 










290 










295 










300 


1281 


Pro 


He 


Ser 


Pro 


Val 


Phe 


Gly 


Pro 


Ser 


Asn 


Trp 


His 


Asn 


Phe 


Val 












o a c 
305 










310 










O 1 C 

3 15 


1283 


Pro 


Pro 


Val 


Arg 


Glu 


Val 


Val 


Pro 


Thr 


Gin 


Gly 


Ala 


Asp 


Pro 


Leu 


1284 










320 










325 










330 


1285 


Leu 


Tyr 


Gly 


Ser 


Leu 


Asn 


Pro 


Val 


Pro 


He 


Pro 


ft T _ 

Ala 


Pro 


Val 


Arg 


lzob 










J Jo 










"5 A ft 

J40 










O A C 

J45 


1287 


Lys 


Trp 


Glu 


Asp 


Val 


Val 


Ala 


Ala 


Gin 


Pro 


Gin 


Arg 


Leu 


Asp 


Thr 


1 *i o o 










OCA 

350 










o c c 
333 










O /T A 

360 


1289 


Ala 


Asp 


Pro 


Ala 


Met 


Leu 


Tyr 


Ala 


Val 


Val 


Asp 


Gly 


Val 


Pro 


Pro 


1 O Q ft 

i z y u 










O £ C 

355 










OTA 

370 










375 


1291 


Thr 


Arg 


Trp 


Lys 


Glu 


Pne 


Met 


Arg 


Leu 


Leu 


Gly 


Leu 


Ser 


Glu 


His 


1 1 o o 

iz yz 










O O A 

380 










O O C 

385 










O Q A 

3 90 


12 93 


Glu 


He 


Glu 


Arc 


Leu 


Glu 


Leu 


Gin 


Asn 


Gly 


Arg 


Cys 


Leu 


Arg 


Glu 


1294 










395 










400 










405 


1295 


Ala 


His 


Tyr 


Ser 


Met 


Leu 


Glu 


Ala 


Trp 


Arg 


Arg 


Arg 


Thr 


Pro 


Arg 


1296 










410 










415 










420 


1297 


His 


Glu 


Ala 


Thr 


Leu 


Asp 


Val 


Val 


Gly 


Arg 


Val 


Leu 


Cys 


Asp 


Met 


1298 










425 










430 










435 


1299 


Asn 


Leu 


Arg 


Gly 


Cys 


Leu 


Glu 


Asn 


He 


Arg 


Glu 


Thr 


Leu 


Glu 


Ser 


1300 










440 










445 










450 


1301 


Pro 


Ala 


His 


Ser 


Ser 


Thr 


Thr 


His 


Leu 


Pro 


Arg 










1302 










455 










460 












1303 

































1412 (2) INFORMATION FOR SEQ ID NO: 71: 



PAGE: 18 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383 ,676 



DATE: 03/22/1999 
TIME: 13:50:58 



INPUT SET: S3 1123. raw 

1413 

1414 (i) SEQUENCE CHARACTERISTICS: 

— > 1415 (A) LENGTH: 340 amwto-^eids 

— > 1416 (B) TYPE ^polypeptide J ) 

1417 

— > 1418 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 

1419 



1420 


Met 


Gly 


Leu 


Ser 


Thr 


Val 


Pro 


Asp 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Leu 


1 A O 1 

14 2 1 










5 










1 A 

1 u 










i r 

1 D 


1422 


Leu 


Glu 


Leu 


Leu 


Val 


Gly 


He 


Tyr 


Pro 


Ser 


Gly 


Val 


He 


Gly 


Leu 


14 2 3 










o a 

20 










25 










30 


1424 


Val 


Pro 


His 


Leu 


Gly 


Asp 


Arg 


Glu 


Lys 


Arg 


Asp 


Ser 


Val 


Cys 


Pro 


14 25 










35 










A A 

40 










45 


1426 
































14 27 


Gin 


Gly 


Lys 


Tyr 


lie 


His 


Pro 


Gin 


Asn 


Asn 


Ser 


He 


Cys 


Cys 


Thr 


1428 










50 










55 










60 


1 A 1 O 

14 2 ? 


Lys 


Cys 


TT X — 

rllS 


Lys 


Giy 


inr 


Tyr 


Leu 


Tyr 


Asn 


Asp 


Cys 


Pro 


taiy 


Pro 


1430 










65 










70 










75 


14 31 


Gly 


Gin 


Asp 


Thr 


Asp 


Cys 


Arg 


Glu 


Cys 


Glu 


Ser 


Gly Ser 


Pne 


Thr 


1432 










80 










85 










90 


1433 


Ala 


Ser 


Glu 


Asn 


His 


Leu 


Arg 


His 


Cys 


Leu 


Ser 


Cys 


Ser 


Lys 


Cys 


14 34 










95 










100 










105 


14 35 


Arg 


Lys 


Glu 


Met 


Gly 


Gin 


val 


Glu 


lie 


Ser 


Ser 


Cys 


Thr 


val 


Asp 


1436 










110 










115 










120 


14 37 


Arg 


Asp 


Thr 


val 


Cys 


Gly 


Cys 


Arg 


Lys 


Asn 


Gin 


Tyr 


Arg 


His 


Tyr 


1438 










125 










130 










135 


14 39 


Trp 


Ser 


Glu 


Asn 


Leu 


Phe 


Gin 


Cys 


Pne 


Asn 


Cys 


Ser 


Leu 


Cys 


Leu 


1440 










140 










145 










150 


1441 


Asn 


Gly 


Thr 


Val 


His 


Leu 


Ser 


Cys 


Gin 


Glu 


Lys 


Gin 


Asn 


Thr 


Val 


1442 










155 










160 










165 


1443 


Cys 


Thr 


Cys 


His 


Ala 


Gly 


Pne 


Pne 


Leu 


Arg 


Glu 


Asn 


Glu 


Cys 


Val 


1444 










170 










175 










180 


1 A A C 

144 3 


Ser 


Cys 


C! a v 


Asn 


Cys 


Lys 


Lys 


Ser 


Leu 


CjIU 


Cys 


Thr 


Lys 


Leu 


Cys 


1446 










185 










190 










195 


1 A A T 

144 / 


Leu 


Pro 


Gin 


lie 


Glu 


Asn 


vai 


Lys 


Giy 


Thr 


G1U 


Asp 


Ser 


Giy 


Thr 


1 A A O 

14 4 o 










O A A 

200 










O rt c 

2 0 5 










OTA 
210 


1 A. A Q 




Vdl 


Leu 


Leu 




Leu 


vdl 


lie 




It IlC 


vj _l y 


Leu 


Cys 


Leu 


Leu 


1450 










215 










220 










225 


1451 


Ser 


Leu 


Leu 


Phe 


He 


Gly 


Leu 


Met 


Tyr 


Arg 


Tyr 


Gin 


Arg 


Trp 


Lys 


1452 










230 










235 










240 


1453 


Ser 


Lys 


Leu 


Tyr 


Ser 


He 


Val 


Cys 


Gly 


Lys 


Ser 


Thr 


Pro 


Glu 


Lys 


1454 










245 










250 










255 


1455 


Glu 


Gly 


Glu 


Leu 


Glu 


Gly 


Thr 


Thr 


Thr 


Lys 


Pro 


Leu 


Ala 


Pro 


Asn 


1456 










260 










265 










270 


1457 


Pro 


Ser 


Phe 


Ser 


Pro 


Thr 


Pro 


Gly 


Phe 


Thr 


Pro 


Thr 


Leu 


Gly 


Phe 


1458 










275 










280 










285 


1459 


Ser 


Pro 


Val 


Pro 


Ser 


Ser 


Thr 


Phe 


Thr 


Ser 


Ser 


Ser 


Thr 


Tyr 


Thr 


1460 










290 










295 










300 


1461 


Pro 


Gly 


Asp 


Cys 


Pro 


Asn 


Phe 


Ala 


Ala 


Pro 


Arg 


Arg 


Glu 


Val 


Ala 


1462 










305 










310 










315 


1463 


Pro 


Pro 


Tyr 


Gin 


Gly 


Ala 


Asp 


Pro 


He 


Leu 


Ala 


Thr 


Ala 


Leu 


Ala 


1464 










320 










325 










330 


1465 


Ser 


Asp 


Pro 


He 


Pro 


Asn 


Pro 


Leu 


Gin 


Lys 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:58 



INPUT SET: S31123.raw 

1466 335 340 

1467 

1468 




1469 (2) INFORMATION FOR SEQ ID NO: 72: 
1470 

1471 (i) SEQUENCE CHARACTERISTICS: 

1472 (A) LENGTH: 19 bases 

1473 (B) TYPE:nucleic acid 

1474 (C) STRANDEDNESS: single 

1475 (D) TOPOLOGY: linear 
1476 

1477 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
1478 

14 7 9 GTACTTGAAC TCGTTCCTG 19 

1480 

1481 

1482 

1483 

1484 

1485 

1486 

1487 



PAGE: 1 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:58 



INPUT SET: S31123.raw 



Line Error Original Text 

9 Number of Sequences (64) Doesn't Equal Actual Count (71) (iii) NUMBER OF SEQUENCES:64 

26 Wrong application Serial Number (A) APPLICATION NUMBER:08/484,312 

42 Entered (1365) and Calc. Seq. Length (0) differ (A) LENGTH: 1365 bases 

54 Entered (483) and Calc. Seq. Length (0) differ (A) LENGTH :483 bases 

66 Entered (455) and Calc. Seq. Length (0) differ (A) LENGTH :455 amino acids 

67 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

69 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

69 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

137 Entered (161) and Calc. Seq. Length (0) differ (A) LENGTH: 161 amino acids 

138 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

140 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

140 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

168 Entered (157) and Calc. Seq. Length (0) differ (A) LENGTH: 157 bases 

1 80 Entered ( 1 3) and Calc. Seq. Length (0) differ (A) LENGTH : 1 3 amino acids 

181 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

183 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

183 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

191 Entered (1 1) and Calc. Seq. Length (0) differ (A) LENGTH:! 1 amino acids 

192 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

194 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

194 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

202 Entered (12) and Calc. Seq. Length (0) differ (A) LENGTH: 12 amino acids 

203 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

205 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

205 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

2 1 3 Entered ( 1 3) and Calc . Seq. Length (0) differ (A) LENGTH : 1 3 amino acids 

214 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

216 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

216 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

224 Entered (14) and Calc. Seq. Length (0) differ (A) LENGTH: 14 amino acids 

225 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

227 Wrong Sequence Number (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

227 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

227 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

232 Sequence * missing (12) INFORMATION FOR SEQ ID NO: 1 1 : 

235 Entered (15) and Calc. Seq. Length (0) differ (A) LENGTH: 15 amino acids 

236 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

238 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 : 

238 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 : 

246 Entered (13) and Calc. Seq. Length (0) differ (A) LENGTH: 13 amino acids 

247 Wrong or Missing Sequence Type (B) TYPE polypeptide 

249 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

249 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

257 Entered ( 1 3) and Calc. Seq. Length (0) differ (A) LENGTH: 1 3 amino acids 

258 Wrong or Missing Sequence Type (B) TYPE:polypeptide 

260 Wrong Or Missing Strandedness Value (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

260 Wrong or Missing Sequence Topology (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

268 Entered (20) and Calc. Seq. Length (0) differ (A) LENGTH :20 amino acids 



PAGE: 



2 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:58 



INPUT SET: S3 1 1 23. raw 



Line 


Error 


Original Text 


269 


Wrong or Missing Sequence Type 


(B) TYPE: polypeptide 


271 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 


271 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 


281 


Entered (19) and Calc. Seq. Length (0) differ 


(A) LENGTH: 19 amino acids 


282 


Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


284 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 


284 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 


294 


Entered (18) and Calc. Seq. Length (0) differ 


(A) LENGTH: 18 amino acids 


295 


11 T 'KM'' O rp 

Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


297 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 


297 


11 T m M ' ' «*1 FT* 1 

Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 


307 


Entered (14) and Calc. Seq. Length (0) differ 


(A) LENGTH: 14 amino acids 


308 


11 T » »• • f"l rp 

Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


310 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 


310 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 


318 


Entered (14) and Calc. Seq. Length (0) differ 


(A) LENGTH: 14 amino acids 


319 


it t l i • * « ft* 

Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


321 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 


321 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 


329 


Entered (13) and Calc. Seq. Length (0) differ 


(A) LENGTH: 13 amino acids 


330 


XX T X\ M * rri 

Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


332 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 


332 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 


484 


Entered (14) and Calc. Seq. Length (0) differ 


(A) LENGTH: 14 amino acids 


485 


Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


487 


xx 7 a m m * * , it * * i 

Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 


487 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 


495 


Entered (14) and Calc. Seq. Length (0) differ 


(A) LENGTH: 14 amino acids 


496 


11 r l j' 1 «"H m 

Wrong or Missing Sequence Type 


/t\\ Ti/r»p i j * j 

(B) TYPE:polypeptide 


498 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 


498 


iir % m' * c\ HP 1 

Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 


650 


Entered (51) and Calc. Seq. Length (0) dirrer 


/ A \ I IT X T /"*< I TCI — * ^ " J „ 

(A) LENGTH :51 amino acids 


651 


\\T A A* o ~ HP 

Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


653 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 


653 


IT 7 A X ' O „ HP 1 

Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 


DO / 


cniereu ^i3<v ana L,aic. oeq. Lengm ^u^ airier 


\t\) LiiiNvj i ri. i jo oases 


765 


Entered (63) and Calc. Seq. Length (0) differ 


(A) LENGTH :63 bases 


111 


Entered (64) and Calc. Seq. Length (0) differ 


(A) LENGTH :64 bases 


813 


Entered (81) and Calc. Seq. Length (0) differ 


(A) LENGTH :81 bases 


873 


Entered (30) and Calc. Seq. Length (18) differ 


(A) LENGTH :30 bases 


885 


Entered (1334) and Calc. Seq. Length (1 134) differ 


(A) LENGTH: 1334 bases 


894 


# of Sequences for line conflicts w/ running total 


TAGCTGTCTG GC 


1000 


Entered (371) and Calc. Seq. Length (0) differ 


(A) LENGTH :371 amino acids 


1001 


Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


1003 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 


1003 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 


1059 


Entered (6464) and Calc. Seq. Length (50) differ 


(A) LENGTH :6464 bases 


1090 


Entered (2173) and Calc. Seq. Length (1727) differ 


(A) LENGTH :2 173 bases 
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SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/383,676 



DATE: 03/22/1999 
TIME: 13:50:59 



INPUT SET: S3 1123. raw 



Line 


Error 


Original Text 


1099 


# of Sequences for line conflicts w/ running total 


A TV* 1 /~*/~*T* /"IT"/"! A T/l rTP ripT i~*r*f~* /"'TV*' /^ r V/~^ f^T 

A 1 0 CjU I Lit CCC A 1 C U 1 0 CC 1 OOC LlOCluCI 


1 1 j i 


wrong rNUCieic /\ciu L/eMgnaior 


ATPw PAP Apr CYTC1 THT PPJP Tfif A AP PAH AtP PA 


1131 


# of Sequences for line conflicts w/ running total 


ATG GAC ACC GTG TGT GGC TGC AAG PAG AkC CA 


1155 


Wrong Nucleic Acid Designator 


GCC GIC CTG TTG CCT CTG GTT ATC TTC CTA GGT 


1155 


# of Sequences for line conflicts w/ running total 


GCC GIC CTG TTG CCT CTG GTT ATC TTC CTA GGT 


1224 


# of Sequences for line conflicts w/ running total 


GGGAAGTGTG CCATCTGCTC CCAGACAGCT GAGGG 


1235 


Entered (461) and Calc. Seq. Length (0) differ 


(A) LENGTH :461 amino acids 


1236 


Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


1238 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 


1238 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 


1415 


Entered (340) and Calc. Seq. Length (0) differ 


(A) LENGTH :340 amino acids 


1416 


Wrong or Missing Sequence Type 


(B) TYPE:polypeptide 


1418 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 


1418 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 



